Code Similarity on High Level Programs
نویسندگان
چکیده
This paper presents a new approach for code similarity on High Level programs. Our technique is based on Fast Dynamic Time Warping, that builds a warp path or points relation with local restrictions. The source code is represented into Time Series using the operators inside programming languages that makes possible the comparison. This makes possible subsequence detection that represent similar code instructions. In contrast with other code similarity algorithms, we do not make features extraction. The experiments show that two source codes are similar when their respective Time Series are similar.
منابع مشابه
Soft Computing Approach to Performance Analysis of Parallel and Distributed Programs
This paper describes a novel approach to performance analysis for parallel and distributed systems that is based on soft computing. We introduce the concept of performance score representing the performance of code regions that is based on fuzzy logic. We propose techniques for fuzzy-based performance classification. A novel high-level query language is designed to support the search for perfor...
متن کاملEffects of Code Obfuscation on Android App Similarity Analysis
Code obfuscation is a technique to transform a program into an equivalent one that is harder to be reverse engineered and understood. On Android, well-known obfuscation techniques are shrinking, optimization, renaming, string encryption, control flow transformation, etc. On the other hand, adversaries may also maliciously use obfuscation techniques to hide pirated or stolen software. If pirated...
متن کاملAssessment of genetic diversity in Pythium aphanidermatum isolates using ISSR and rep-CR methods
Sixty isolates of Pythium aphanidermatum as the causal agent of sugar beet root rot were selected on the basis of their geographical origins, morphological and genetic diversity studies. Pathogenicity test using the Kruskal-Wallis analysis showed significant differences between the pathogenic potential of isolates. Based on the growth rate, isolates were categorized in two groups with low and h...
متن کاملAutomatic Rootcausing for Program Equivalence Failures in Binaries
Equivalence checking of imperative programs has several applications including compiler validation and cross-version verification. Debugging equivalence failures can be tedious for large examples, especially for low-level binary programs. In this paper, we formalize a simple yet precise notion of verifiable rootcause for equivalence failures that leverages semantic similarity between two progra...
متن کاملAn evaluation of duplicate code detection using anti-unification
This paper describes an algorithm for finding software clones, which works at the level of abstract syntax trees and is thus conceptually independent of the source language of the analyzed programs. We use a notion of clones which captures replacement of subtrees in the program AST, and is formally based on the notion of anti-unification. This allows us to capture syntactic structural similarit...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/0710.5547 شماره
صفحات -
تاریخ انتشار 2007